Add logs and extend metrics for `graphql_validation_mode: both` #3674

goto-bus-stop · 2023-08-25T08:59:09Z

This adds logging for query validation errors with either Rust or JS when there is a mismatch, i.e. one of them validates but the other does not. In other cases we are not really interested in the specific error (it will just go back to the user), so we don't need to log there.

To log the Rust validation error well, I now store the ApolloDiagnostics that were produced on Query{}. Query is serializable for caching, but ApolloDiagnostic is not. Here I just skipped serializing ApolloDiagnostic so if Query is loaded from cache, it does not have the validation error stored. I'm not sure this is the right thing to do. The ApolloDiagnostics are later used after query planning (which may produce a JS validation error). So it's correct if we can ~safely assume that we only have valid Query instances cached. Otherwise we might get spurious error logs from this.

So is that a safe assumption? Reading the CachingQueryPlanner implementation I think it does only store errors (then it's not a Query instance) and fully successful planning (then it has run both Rust and JS validation already). So it looks fine, but it could be a bit brittle to rely on this.

I also simplified the validation error printing which

depends on [email protected] apollo-rs#630.
and on fix(deps): update apollo-rs crates (patch) #3675

Closes #3681

github-actions · 2023-08-25T08:59:23Z

@goto-bus-stop, please consider creating a changeset entry in /.changesets/. These instructions describe the process and tooling.

router-perf · 2023-08-25T08:59:39Z

BrynCooke · 2023-09-01T12:47:11Z

apollo-router/src/error.rs

+
+        self.errors.iter().for_each(|err| {
+            // Outputs a pretty colourised report on TTYs
+            eprintln!("{err}");


We should really gate this entire function to test scope. We should never call println or variants in prod code. Otherwise if this is something that should be used then it should use tracing.

Makes sense! I've removed this function as it is only used in one place (supergraph schema validation error) and that already bubbles up the error. The bubbling up lost the actual underlying error message as described in #3681, so I've changed that in 525f376.

Schema parse errors do get bubbled up (though they get obscured by #3681)

This adds logging for query validation errors with either Rust or JS when there is a mismatch, i.e. one of them validates but the other does not. In other cases we are not really interested in the specific error (it will just go back to the user), so we don't need to log there. To log the Rust validation error well, I now store the ApolloDiagnostics that were produced on `Query{}`. `Query` is serializable for caching, but ApolloDiagnostic is not. Here I just skipped serializing `ApolloDiagnostic` so if `Query` is loaded from cache, it does not have the validation error stored. I'm not sure this is the right thing to do. The ApolloDiagnostics are later used after query planning (which may produce a JS validation error). So it's correct if we can ~safely assume that we only have valid Query instances cached. Otherwise we might get spurious error logs from this. - [ ] So is that a safe assumption? Reading the CachingQueryPlanner implementation I think it does only store errors (then it's not a `Query` instance) and fully successful planning (then it has run both Rust and JS validation already). So it looks fine, but it could be a bit brittle to rely on this. I also simplified the validation error printing which - [x] depends on apollographql/apollo-rs#630. - [x] and on #3675  **Checklist** Complete the checklist (and note appropriate exceptions) before a final PR is raised. - [ ] Changes are compatible[^1] - [ ] Documentation[^2] completed - [ ] Performance impact assessed and acceptable - Tests added and passing[^3] - [ ] Unit Tests - [ ] Integration Tests - [ ] Manual Tests **Exceptions** *Note any exceptions here* **Notes** [^1]. It may be appropriate to bring upcoming changes to the attention of other (impacted) groups. Please endeavour to do this before seeking PR approval. The mechanism for doing this will vary considerably, so use your judgement as to how and when to do this. [^2]. Configuration is an important part of many changes. Where applicable please try to document configuration examples. [^3]. Tick whichever testing boxes are applicable. If you are adding Manual Tests: - please document the manual testing (extensively) in the Exceptions. - please raise a separate issue to automate the test and label it (or ask for it to be labeled) as `manual test`

goto-bus-stop added 3 commits August 24, 2023 10:59

chore: keep ApolloDiagnostics around for comparison to JS

cee211f

Log query validation errors when there is a mismatch

76b7f74

Simplify validation error printing

5c97bca

apollo-bot2 assigned goto-bus-stop Aug 25, 2023

goto-bus-stop added 2 commits August 25, 2023 11:02

Merge branch 'dev' into renee/log-query-validation-errors

73eff8e

chore(deps): update apollo-compiler

1f3c510

Geal requested review from a team, garypen, Geal, bnjjj and SimonSapin and removed request for garypen August 30, 2023 14:21

Merge branch 'dev' into renee/log-query-validation-errors

6697f6d

abernix removed the request for review from bnjjj September 1, 2023 08:23

explain to_string

a532041

Geal approved these changes Sep 1, 2023

View reviewed changes

BrynCooke requested changes Sep 1, 2023

View reviewed changes

goto-bus-stop added 5 commits September 7, 2023 09:48

Remove ValidationErrors::print

6f1a997

Schema parse errors do get bubbled up (though they get obscured by #3681)

Extend error message for parse/validation errors on startup, fixes #3681

525f376

Merge branch 'dev' into renee/log-query-validation-errors

bb54b7e

remove obsolete import

c3a7a90

fix lint

6fefafa

abernix requested a review from BrynCooke September 8, 2023 08:25

Produce metrics for fully successful results, too

00809b7

goto-bus-stop changed the title ~~Log query validation errors when graphql_validation_mode: both~~ Add logs and extend metrics for graphql_validation_mode: both Sep 8, 2023

BrynCooke approved these changes Sep 8, 2023 •

edited

Loading

View reviewed changes

goto-bus-stop merged commit b6164b3 into dev Sep 8, 2023

goto-bus-stop deleted the renee/log-query-validation-errors branch September 8, 2023 13:03

goto-bus-stop mentioned this pull request Sep 11, 2023

Failure to parse schema does not output any information on where the parse failure occured #3681

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add logs and extend metrics for `graphql_validation_mode: both` #3674

Add logs and extend metrics for `graphql_validation_mode: both` #3674

goto-bus-stop commented Aug 25, 2023 •

edited by abernix

Loading

github-actions bot commented Aug 25, 2023

router-perf bot commented Aug 25, 2023

BrynCooke Sep 1, 2023

goto-bus-stop Sep 7, 2023

Add logs and extend metrics for graphql_validation_mode: both #3674

Add logs and extend metrics for graphql_validation_mode: both #3674

Conversation

goto-bus-stop commented Aug 25, 2023 • edited by abernix Loading

github-actions bot commented Aug 25, 2023

router-perf bot commented Aug 25, 2023

BrynCooke Sep 1, 2023

Choose a reason for hiding this comment

goto-bus-stop Sep 7, 2023

Choose a reason for hiding this comment

Add logs and extend metrics for `graphql_validation_mode: both` #3674

Add logs and extend metrics for `graphql_validation_mode: both` #3674

goto-bus-stop commented Aug 25, 2023 •

edited by abernix

Loading